Multi-agent adaptive routing by multi-headattention-based twin agents using reinforcement learning
Annotation
A regular condition, typical for packet routing, for the problem of cargo transportation, and for the problem of flow control, is the variability of the graph. Reinforcement learning based adaptive routing algorithms are designed to solve the routing problem with this condition. However, with significant changes in the graph, the existing routing algorithms require complete retraining. To handle this challenge, we propose a novel method based on multi-agent modeling with twin-agents for which new neural network architecture with multi-headed internal attention is proposed, pre-trained within the framework of the multi-view learning paradigm. An agent in such a paradigm uses a vertex as an input, twins of the main agent are placed at the vertices of the graph and select a neighbor to which the object should be transferred. We carried out a comparative analysis with the existing DQN-LE-routing multi-agent routing algorithm on two stages: pre-training and simulation. In both cases, launches were considered by changing the topology during testing or simulation. Experiments have shown that the proposed adaptability enhancement method provides global adaptability by increasing delivery time only by 14.5 % after global changes occur. The proposed method can be used to solve routing problems with complex path evaluation functions and dynamically changing graph topologies, for example, in transport logistics and for managing conveyor belts in production.
Keywords
Постоянный URL
Articles in current issue
- Pulse recording of dynamic holograms in bismuth silicate crystal in a broad wavelength range
- Hybrid endoscope with television and multispectral image processing for the internal organs cancer early diagnostics
- Modelling of a composite waveguide holographic display
- Application of infrared spectroscopy methods in studying compositions for paper sizing
- Distribution optimization method of pixel density by surveillance area
- Evaluation and development of a method for compensating the positioning error of computer numeric control equipment
- Compensation of output external disturbances for a class of linear systems with control delay
- Luminescence technique for studying the growth of AgInS2 quantum dots
- Peculiarities of pulsed laser deposition of thin InGaAsN films in an active background gas atmosphere
- Determination of the electron distribution in thin barrier AlGaAs/GaAs superlattices by capacitance-voltage profiling
- Spectral and kinetic properties of silver sulfide quantum dots in an external electric field
- Influence of nano-sized horizontal inhomogeneities on surface profiling by means of XPS
- Organic light-emitting diodes with new dyes based on coumarin
- Fabrication and characterization of hybrid composite of Al6082/SiC/rice husk powder using friction stir processing
- A multi-path secure routing for the detection of node capturing attack in wireless sensor network
- A method for documenting architectural solutions of computing platforms
- Improving out of vocabulary words recognition accuracy for an end-to-end Russian speech recognition system
- Method for monitoring the state of elements of cyber-physical systems based on time series analysis
- Application of the text wave model to the sentiment analysis problem
- Automated evaluation of ECG parameters during the COVID-19 pandemic
- Joint learning of agents and graph embeddings in a conveyor belt control problem
- Simulation of radiative transfer in gas-liquid foams
- The effect of signal-to-noise ratio value on the error in measuring acoustic emission parameters: statistical assessment
- Simulating the process of steady-state thermoreflectance for measuring the thermal conductivity of materials
- Modeling and simulation ofone- and two-row six-bladed ducted fans
- Differential-difference model of heat transfer in solids using the method of parametric identification